NM_198185 ACCESSION :NM_198185 NID: gi 38044105 ref NM_198185.1 
Homo sapiens oviductin protease (OVTN) , mRNA 
Length = 1695 

Score = 619 bits (1580), Expect = e-175 

Identities = 300/302 (99%), Positives = 300/302 (99%), Gaps = 0/302 (0%) 
Frame = +1 

Query: 5 MLISRNKLILLLGIVFFERGKSAALSLPKAPSCGQSLVKVQPWNYFNIFSRILGGSQVEK 62 

MLISRNKLILLLGIVFFERGKSA LSLPKAPSCGQSLVKVQPWNYFNIFSRILGGSQVEK 
Sbjct: 1 MLISRNKLILLLGIVFFERGKSATLSLPKAPSCGQSLVKVQPWNYFNIFSRILGGSQVEK 180 

Query: 63 GSYPWQVSLKQRQKHICGGSIVSPQWVITAAHCIANRNIVSTLNVTAGEYDLSQTDPGEQ 122 

GSYPWQVSLKQRQKHICGGSIVSPQWVITAAHCIANRNIVSTLNVTAGEYDLSQTDPGEQ 
Sbjct: 181 GSYPWQVSLKQRQKHICGGSIVSPQWVITAAHCIANRNIVSTLNVTAGEYDLSQTDPGEQ 360 

Query: 123 TLTIETVIIHPHFSTKKPMDYDIALLKMAGAFQFGHFVGPICLPELREQFEAGFICTTAG 182 

TLTIETVIIHPHFSTKKPMDYDIALLKMAGAFQFGHFVGPICLPELREQFEAGFICTTAG 
Sbjct: 361 TLTIETVIIHPHFSTKKPMDYDIALLKMAGAFQFGHFVGPICLPELREQFEAGFICTTAG 540 

Query: 183 WGRLTEGGVLSQVLQEVNLPILTWEECVAALLTLKRPISGKTFLCTGFPDGGRDACQGDS 242 

WGRLTEGGVLSQVLQEVNLPILTWEECVAALLTLKRPISGKTFLCTGFPDGGRDACQGDS 
Sbjct: 541 WGRLTEGGVLSQVLQEVNLPILTWEECVAALLTLKRPISGKTFLCTGFPDGGRDACQGDS 720 

Query: 243 GGSLMCRNKKGAWTLAGVTSWGLGCGRGWRNNVRKSDQGSPGIFTDISKVLSWIHEHIQT 3 02 

GGSLMCRNKKGAWTLAGVTSWGLGCGRGWRNNVRKSDQGSPGIFTDISKVL WIHEHIQT 
Sbjct: 721 GGSLMCRNKKGAWTLAGVTSWGLGCGRGWRJSIIWRKSDQGSPGIFTDISKVLPWIHEHIQT 900 

Query: 303 GN 304 
GN 

Sbjct: 901 GN 906 
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NM_198185 1695 bp 

Homo sapiens oviductin protease 
NM„198185 

NM 198185.1 GI:38044105 



mRNA 
(OVTN) , 



linear 
mRNA. 
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gene 



CDS 



sig_peptide 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Puente,X.S., Sanchez , L .M. , Overall, CM. and Lopez-Otin, C . 

Human and mouse proteases: a comparative genomic approach 

Nat. Rev. Genet. 4 (7), 544-558 (2003) 

22722134 

12838346 

PROVISIONAL REFSEQ: This record has not yet been subject to final 
NCBI review. The reference sequence was derived from BN000120 . 1 . 

Location/Qualifiers 

1. .1695 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/ db_xr e f = " t axon : 9 6 0 6 " 

/ chr omo s ome ="11" 

/map="llpl5.4" 

1. .1695 

/gene="OVTN" 

/db_xref=" Locus ID: 341272" 

1. .1695 

/gene="OVTN" 

/function=" serine protease" 
/codon_start=l 

/product=" oviductin protease" 
/protein_id= " NP_937828 . 1 " 
/db_xref="GI: 38044106" 
/ db_xre f =" Locus ID : 341277 " 

/translation="MLISRNKLILLLGIVFFERGKSATLSLPKAPSCGQSLVKVQPWN 
YFNIFSRILGGSQVEKGSYPWQVSLKQRQKHICGGSIVSPQWVITAAHCIANRNIVST 
LNVTAGEYDLSQTDPGEQTLTIETVIIHPHFSTKKPMDYDIALLKMAGAFQFGHFVGP 
ICLPELREQFEAGFICTTAGWGRLTEGGVLSQVLQEVNLPILTWEECVAALLTLKRPI 
SGKTFLCTGFPDGGRDACQGDSGGSLMCRNKKGAWTLAGVTSWGLGCGRGWRNNVRKS 
DQGSPGIFTDISKVLPWIHEHIQTGNRRKSSRAWCSEQDVIVSGAEGKLHFPESLHLY 
YESKQRCVWTLLVPEEMHVLLSFSHLDVESCHHSYLSMYSLEDRPIGKFCGESLPSSI 
LIGSNSLRLKFVSDATDYAAGFNLTYKALKPNYIPDSGCSYLTVLFEEGLIQSLNYPE 
NYSDKANCDWIFQASKHHLIKLSFQSLEIEESGDCTSDYVTVHSDVERKKEIARLCGY 
DVPTPVLSPSSIMLISFQSDENGTCRGFQATVSFIPKAGKKIELPTLWFPVLILVM" 
1. .69 

/gene="OVTN" 
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mat_peptide 70. .1692 

/gene="OVTN" 

/product=" oviduct in protease" 
misc_f eature 151.. 882 

/gene="OVTN" 

/note="Tryp_SPc; Region: Trypsin-like serine protease" 
/ db_xr e f = " C DD : smart00020 " 
inisc„f eature 976.. 1254 

/gene="OVTN" 

/note="CUB; Region: CUB domain" 
/ db_xr e f = " CDD : pfam00431 " 
misc^f eature 1321.. 1617 

/gene="OVTN" 

/note="CUB; Region: CUB domain" 
/ db_xr e f = " CDD : pfam00431 " 

ORIGIN 

1 atgcttataa gcaggaacaa gctgatttta ctactaggaa tagtcttttt tgaacgaggt 
61 aaatctgcaa ctctttcgct ccccaaagct cccagttgtg ggcagagtct ggttaaggta 
121 cagccttgga attattttaa cattttcagt cgcattcttg gaggaagcca agtggagaag 
181 ggttcctatc cctggcaggt atctctgaaa caaaggcaga agcatatttg tggaggaagc 
241 atcgtctcac cacagtgggt gatcacggcg gctcactgca ttgcaaacag aaacattgtg 
301 tctactttga atgttactgc tggagagtat gacttaagcc agacagaccc aggagagcaa 
3 61 actctcacta ttgaaactgt catcatacat ccacatttct ccaccaagaa accaatggac 
421 tatgatattg cccttttgaa gatggctgga gccttccaat ttggccactt tgtggggccc 
481 atatgtcttc cagagctgcg ggagcaattt gaggctggtt ttatttgtac aactgcaggc 
541 tggggccgct taactgaagg tggcgtcctc tcacaagtct tgcaggaagt gaatctgcct 
601 attttgacct gggaagagtg tgtggcagct ctgttaacac taaagaggcc catcagtggg 
661 aagacctttc tttgcacagg ttttcctgat ggagggagag acgcatgtca gggagattca 
721 ggaggttcac tcatgtgccg gaataagaaa ggggcctgga ctctggctgg tgtgacttcc 
781 tggggtttgg gctgtggtcg aggctggaga aacaatgtga ggaaaagtga tcaaggatcc 
841 cctgggatct tcacagacat tagtaaagtg cttccctgga tccacgaaca catccaaact 
901 ggtaatcgga gaaagagctc cagagcctgg tgcagtgagc aggatgtcat agtcagcggg 
961 gctgagggga agctgcactt cccagaaagc ctccacctat attatgagag caagcaacgg 
1021 tgtgtctgga ccctgctggt accagaggaa atgcatgtgt tgctcagttt ttcccaccta 
1081 gatgttgagt cttgtcacca cagttacctg tcaatgtatt ctttagaaga cagacccatt 
1141 ggaaaatttt gtggagaaag cctcccttca tccattctta ttggctctaa ttctctaagg 
12 01 ctgaaattcg tctctgatgc cacagattat gcagctgggt ttaatcttac ctataaagct 
1261 cttaaaccaa actacattcc tgattcaggt tgcagttact taactgtcct ttttgaagaa 
1321 ggtctcatac agagtctaaa ctatcctgaa aactacagtg acaaggctaa ctgtgactgg 
1381 atttttcaag cctccaaaca tcacctaatt aagctttcat ttcagagtct ggaaatagaa 
1441 gaaagtggag actgcacttc cgactatgtg acagtgcaca gcgatgtaga aaggaagaag 
1501 gaaatagctc ggctgtgtgg ctatgatgtc cccacccctg tgctgagccc ctccagcatc 
1561 atgctcatca gcttccaatc agatgaaaac gggacctgca ggggctttca ggctacagtc 
1621 tccttcattc ctaaagcagg taagaagata gagttaccaa ctctttggtt cccagttcta 
1681 atccttgtga tgtga 

// 
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Homo sapiens. Human CVSP14 full length cDNA. PA:CORVAS INT INC. 
PN:WO200277263-A2 . gsn 
Length = 1035 

Score = 628 bits (1603), Expect = e-179 

Identities = 305/306 (99%), Positives = 306/306 (100%), Gaps = 0/306 (0%) 
Frame = +1 

Query: 1 MSLKMLISRNKLILLLGIVFFERGKSAALSLPKAPSCGQSLVKVQPWNYFNIFSRILGGS 58 

MSLKMLISRNKLILLLGIVFFE+GKSAALSLPKAPSCGQSLVKVQPWNYFNIFSRILGGS 
Sbjct: 115 MSLKMLISRNKLILLLGIVFFEQGKSAALSLPKAPSCGQSLVKVQPWNYFNIFSRILGGS 294 

Query : 59 QVEKGSYPWQVSLKQRQKHICGGSIVSPQWVITAAHCIANRNIVSTLNVTAGEYDLSQTD 118 

QVEKGSYPWQVSLKQRQKHICGGSIVSPQWVITAAHCIANRNIVSTLNVTAGEYDLSQTD 
Sbjct: 295 QVEKGSYPWQVSLKQRQKHICGGSIVSPQWVITAAHCIANRNIVSTLNVTAGEYDLSQTD 474 

Query: 119 PGEQTLTIETVIIHPHFSTKKPMDYDIALLKMAGAFQFGHFVGPICLPELREQFEAGFIC 178 

PGEQTLTIETVIIHPHFSTKKPMDYDIALLKMAGAFQFGHFVGPICLPELREQFEAGFIC 
Sbjct: 475 PGEQTLTIETVIIHPHFSTKKPMDYDIALLKMAGAFQFGHFVGPICLPELREQFEAGFIC 654 

Query: 179 TTAGWGRLTEGGVLSQVLQEVNLPILTWEECVAALLTLKRPISGKTFLCTGFPDGGRDAC 238 

TTAGWGRLTEGGVLSQVLQEVNLPILTWEECVAALLTLKRPISGKTFLCTGFPDGGRDAC 
Sbjct: 655 TTAGWGRLTEGGVLSQVLQEVNLPILTWEECVAALLTLKRPISGKTFLCTGFPDGGRDAC 834 

Query: 239 QGD S GG S LMCRNKKGAWTLAGVTSWGLGCGRGWRNNVRKSDQG SPG I FTD I SKVLSWI HE 298 

QGDSGGSLMCRNKKGAWTLAGVTSWGLGCGRGWRNNVRKSDQGSPGIFTDISKVLSWIHE 
Sbjct: 835 QGDSGGSLMCRNKKGAWTLAGVTSWGLGCGRGWRNNVRKSDQGSPGIFTDISKVLSWIHE 1014 

Query: 299 HIQTGN 304 
HIQTGN 

Sbjct: 1015HIQTGN 1032 / 
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(84) 



ENG 

22 March 2001 (22.03.2001) 



US 
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20 March 2002 (20.03.2002) 
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No Image Available. 

Abstract 

Provided herein are polypeptides designated CVSP14 polypeptides that exhibit 
protease activity as a single chain or as an activated two chain form. Methods using 
the polypeptides to identify compounds that modulate the protease activity thereof 
are provides. The polypeptides also serve as tumor markers. 
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Homo sapiens. Human protease PRTS-20 cDNA sequence. PA:INCYTE GENOMICS INC. 
PN:WO200198468-A2 . gsn 
Length = 12 62 

Score = 628 bits (1603), Expect = e-179 

Identities = 305/306 (99%), Positives = 305/306 (99%), Gaps = 0/306 (0%) 
Frame = +3 

Query: 1 MSLKMLISRNKLILLLGIVFFERGKSAALSLPKAPSCGQSLVKVQPWNYFNIFSRILGGS 58 

MSLKMLISRNKLILLLGIVFFERGKSA LSLPKAPSCGQSLVKVQPWNYFNIFSRILGGS 
Sbjct: 165 MSLKMLISRNKLILLLGIVFFERGKSATLSLPKAPSCGQSLVKVQPWNYFNIFSRILGGS 344 

Query: 59 QVEKGSYPWQVSLKQRQKHICGGSIVSPQWVITAAHCIANRNIVSTLNVTAGEYDLSQTD 118 

QVEKGSYPWQVSLKQRQKHICGGSIVSPQWVITAAHCIANRNIVSTLNVTAGEYDLSQTD 
Sbjct: 345 QVEKGSYPWQVSLKQRQKHICGGSIVSPQWVITAAHCIANRNIVSTLNVTAGEYDLSQTD 524 

Query: 119 PGEQTLTIETVIIHPHFSTKKPMDYDIALLKMAGAFQFGHFVGPICLPELREQFEAGFIC 178 

PGEQTLTIETVIIHPHFSTKKPMDYDIALLKMAGAFQFGHFVGPICLPELREQFEAGFIC 
Sbjct: 525 PGEQTLTIETVIIHPHFSTKKPMDYDIALLKMAGAFQFGHFVGPICLPELREQFEAGFIC 704 

Query: 179 TTAGWGRLTEGGVLSQVLQEVNLPILTWEECVAALLTLKRPISGKTFLCTGFPDGGRDAC 238 

TTAGWGRLTEGGVLSQVLQEVNLPILTWEECVAALLTLKRPISGKTFLCTGFPDGGRDAC 
Sbjct: 705 TTAGWGRLTEGGVLSQVLQEVNLPILTWEECVAALLTLKRPISGKTFLCTGFPDGGRDAC 884 

Query: 239 QGDSGGSLMCRNKKGAWTLAGVTSWGLGCGRGWRNNVRKSDQGSPGIFTDISKVLSWIHE 298 

QGDSGGSLMCRNKKGAWTLAGVTSWGLGCGRGWRNNVRKSDQGSPGIFTDISKVLSWIHE 
Sbjct: 885 QGDSGGSLMCRNKKGAWTLAGVTSWGLGCGRGWRNNVRKSDQGSPGIFTDISKVLSWIHE 1064 

Query: 299 HIQTGN 304 

HIQTGN 
Sbjct: 1065HIQTGN 1082 
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ZW), Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GW, ML, MR, NE, SN, TD, TG) 
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Abstract 

The invention provides human proteases (PRTS) and polynucleotides which identify 
and encode PRTS. The invention also provides expression vectors, host cells, 
antibodies, agonists, and antagonists. The invention also provides methods for 
diagnosing, treating, or preventing disorders associated with aberrant expression of 
PRTS. 
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characterize the protein. A starting material that can only be used to produce 
a final product does not have a substantial asserted utility in those instances 
where the final product is not supported by a specific and substantial utility. 
In this case none of the proteins that are to be produced as final products 
resulting from processes involving the claimed cDNA have asserted or 
identified specific and substantial utilities. The research contemplated by 
Applicants to characterize potential protein products, especially their 
biological activities, does not constitute a specific and substantial utility. 
Identifying and studying the properties of the protein itself or the 
mechanisms in which the protein is involved does not defme a "real world" 
context of use. Note, because the claimed invention is not supported by a 
specific and substantial asserted utility for the reasons set forth above, 
credibility has not been assessed. Neither the specification as filed nor any 
art of record discloses or suggests any property or activity for the cDNA 
compounds such that another non-asserted utility would be well established 
for the compounds. 

Claim 1 is also rejected under 35 U.S.C. § 1 12, first paragraph. 
Specifically, since the claimed invention is not supported by either a specific 
and substantial asserted utility or a well established utility for the reasons set 
forth above, one skilled in the art would not know how to use the claimed 
invention. 

Example 10: DNA Fragment encoding a Full Open Reading Frame 
(ORF) 

Specification: The specification discloses that a cDNA library was prepared 
from human kidney epithelial cells and 5000 members of this library were 
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sequenced and open reading frames were identified. The specification 
discloses a Table that indicates that one member of the library having SEQ 
ID NO: 2 has a high level of homology to a DNA ligase. The specification 
teaches that this complete ORF (SEQ ID NO: 2) encodes SEQ ID NO: 3. 
An alignment of SEQ ID NO: 3 with known amino acid sequences of DNA 
ligases indicates that there is a high level of sequence conservation between 
the various known ligases. The overall level of sequence similarity between 
SEQ ID NO: 3 and the consensus sequence of the known DNA ligases that 
are presented in the specification reveals a similarity score of 95%. A search 
of the prior art confirms that SEQ ID NO: 2 has high homology to DNA 
Ligase encoding nucleic acids and that the next highest level of homology is 
to alpha-actin. However, the latter homology is only 50%. Based on the 
sequence homologies, the specification asserts that SEQ ID NO: 2 encodes a 
DNA ligase. 

Claim 1; An isolated and purified nucleic acid comprising SEQ ID NO: 2. 

Analysis: The following analysis includes the questions that need to be 
asked according to the guidelines and the answers to those questions based 
on the above facts: 

1) Based on the record, is there a "well established utility" for the 
claimed invention? Based upon applicant's disclosure and the results of the 
PTO search, there is no reason to doubt the assertion that SEQ ID NO: 2 
encodes a DNA ligase. Further, DNA ligases have a well-established use in 
the molecular biology art based on this class of protein's ability to ligate 
DNA. Consequently the answer to the question is yes. 
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Note that if there is a well-established utility already associated with the 
claimed invention, the utility need not be asserted in the specification as 
filed. In order to determine whether the claimed invention has a well- 
established utility the examiner must determine that the invention has a 
specific, substantial and credible utility that would have been readily 
apparent to one of skill in the art. In this case SEQ ID NO: 2 was shown to 
encode a DNA ligase that the artisan would have recognized as having a 
specific, substantial and credible utility based on its enzymatic activity. 

Thus, the conclusion reached from this analysis is that a 35 U.S. C. § 
101 rejection and a 35 U.S.C. § 1 12, first paragraph, utility rejection should 
not be made. 

Example 11: Animals with Uncharacterized Human Genes 

Specification: Kidney cells from a patient with Polycystic Kidney (PCK) 
Disease have been used to make a cDNA library. From this library 8000 
nucleotide "fragments" have been sequenced but not yet used to express 
proteins in a transformed host cell nor have they been characterized in any 
other way. The 50 longest fragments, SEQ ID NO: 1-50, respectively, have 
been used to make transgenic mice. None of the 50 lines of mice have 
developed Polycystic Kidney Disease to date. The asserted utility is the use 
of the mice to research human genes from diseased human kidneys. The 
disease is inheritable, but chromosomal loci have not yet been identified. 
Neither the absence or presence of a specific protein has been identified with 
the disease condition. 
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Query= SEQ ID N0:1 

(921 letters) 



Score E 

Sequences producing significant alignments: (bits) Value 

AC104237. 2. 1.164732 391 e-106 

>AC104237. 2. 1,164732 

Length = 164732 

Score = 391 bits (197), Expect = e-106 
Identities = 200/201 (99%) 
Strand = Plus / Plus 

Query: 721 cagggagattcaggaggttcactcatgtgccggaataagaaaggggcctggactctggct 780 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 51293 cagggagattcaggaggttcactcatgtgccggaataagaaaggggcctggactctggct 51352 
Query: 781 ggtgtgacttcctggggtttgggctgtggtcgaggctggagaaacaatgtgaggaaaagt 840 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIIIMIIIIIMI 

Sbjct: 51353 ggtgtgacttcctggggtttgggctgtggtcgaggctggagaaacaatgtgaggaaaagt 51412 
Query: 841 gatcaaggatcccctgggatcttcacagacattagtaaagtgctttcctggatccacgaa 900 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII llllllllllllll 

Sbjct: 51413 gatcaaggatcccctgggatcttcacagacattagtaaagtgcttccctggatccacgaa 51472 
Query: 901 cacatccaaactggtaactaa 921 

MIIIIIMIIMIIIIIIII 

Sbjct: 51473 cacatccaaactggtaactaa 51493 



Score = 349 bits (176), Expect = le-93 
Identities = 176/176 (100%) 
Strand = Plus / Plus 

Query: 301 agaaacattgtgtctactttgaatgttactgctggagagtatgacttaagccagacagac 3 60 

IIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIM 

Sbjct: 49450 agaaacattgtgtctactttgaatgttactgctggagagtatgacttaagccagacagac 49509 
Query: 361 ccaggagagcaaactctcactattgaaactgtcatcatacatccacatttctccaccaag 420 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

Sbjct: 49510 ccaggagagcaaactctcactattgaaactgtcatcatacatccacatttctccaccaag 49569 
Query: 421 aaaccaatggactatgatattgcccttttgaagatggctggagccttccaatttgg 476 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 49570 aaaccaatggactatgatattgcccttttgaagatggctggagccttccaatttgg 49625 



Score = 308 bits (155), Expect = 5e-81 
Identities = 155/155 (100%) 
Strand = Plus / Plus 

Query: 570 aggtggcgtcctctcacaagtcttgcaggaagtgaatctgcctattttgacctgggaaga 629 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 50304 aggtggcgtcctctcacaagtcttgcaggaagtgaatctgcctattttgacctgggaaga 50363 
Query: 630 gtgtgtggcagctctgttaacactaaagaggcccatcagtgggaagacctttctttgcac 689 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 50364 gtgtgtggcagctctgttaacactaaagaggcccatcagtgggaagacctttctttgcac 50423 
Query: 690 aggttttcctgatggagggagagacgcatgtcagg 724 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 50424 aggttttcctgatggagggagagacgcatgtcagg 50458 



Score = 226 bits (114), Expect = 2e-56 
Identities = 114/114 (100%) 
Strand _= Plus / Plus 

Query: 99 agctcccagttgtgggcagagtctggttaaggtacagccttggaattattttaacatttt 158 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 47105 agctcccagttgtgggcagagtctggttaaggtacagccttggaattattttaacatttt 47164 
Query: 159 cagtcgcattcttggaggaagccaagtggagaagggttcctatccctggcaggt 212 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 47165 cagtcgcattcttggaggaagccaagtggagaagggttcctatccctggcaggt 47218 



Score = 197 bits (99), Expect = le-47 
Identities = 99/99 (100%) 
Strand = Plus / Plus 

Query: 475 ggccactttgtggggcccatatgtcttccagagctgcgggagcaatttgaggctggtttt 534 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 49969 ggccactttgtggggcccatatgtcttccagagctgcgggagcaatttgaggctggtttt 50028 
Query: 535 atttgtacaactgcaggctggggccgcttaactgaaggt 573 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 50029 atttgtacaactgcaggctggggccgcttaactgaaggt 50067 



Score = 191 bits (96), Expect = 9e-46 
Identities = 98/100 (98%) 
Strand = Plus / Plus 



Query: 1 




Sbjct: 45375 atgagtctcaaaatgcttataagcaggaacaagctgattttactactaggaatagtcttt 45434 



Sbjct: 45435 tttgaacgaggtaaatctgcaactctttcgctccccaaag 45474 

Score = 187 bits (94), Expect = le-44 
Identities = 94/94 (100%) 
Strand = Plus / Plus 

Query: 209 aggtatctctgaaacaaaggcagaagcatatttgtggaggaagcatcgtctcaccacagt 2 68 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 47990 aggtatctctgaaacaaaggcagaagcatatttgtggaggaagcatcgtctcaccacagt 48049 
Query: 269 gggtgatcacggcggctcactgcattgcaaacag 3 02 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Sbjct: 48050 gggtgatcacggcggctcactgcattgcaaacag 48083 



Query: 61 
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AC104237 164732 bp DNA linear PRI 30-APR-2002 

Homo sapiens chromosome 11, clone RP11-35J10, complete sequence. 
AC104237 

AC104237 .2 GI: 2 0128277 
HTG. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 

1 (bases 1 to 164732) 

Birren,B., Linton, L., Nusbaum,C. and Lander, E. 
Homo sapiens chromosome 11, clone RP11-35J10 
Unpublished 

2 (bases 1 to 164732) 

Birren,B., Linton, L., Nusbaum,C., Lander, E., Ali,A., Allen, N., 
Anderson, S., Barna,N., Bastien,V., Boguslavkiy , L . , Boukhgalter , B . , 
Brown, A., Camarata,J., Campopiano , A . , Chang, J., Chazaro,B., 
Choepel,Y., Colangelo, M. , Collins, S., Collymore,A. , Cook, A., 
Cooke, P., DeArellano, K. , Dewar,K., Diaz, J. S., Dodge, S., Faro, S . , 
Ferreira,P., FitzHugh,W. , Gage,D., Galagan,J., Gardyna,S., 
Ginde,S., Gord,S., Goyette,M., Graham, L., Grand-Pierre, N. , 
Hagos,B., Heaford,A., Horton,L., Hulme,W. , Iliev,I., Johnson, R., 
Jones, C, Kamat,A., Karatas,A. , Kells,C., LaRocque,K., 
Lamazares,R. , Landers, T., Lehoczky,J., Levine,R., Liu,G., 
MacLean,C., Macdonald, P . , Major, J., Marquis, N., Matthews, C, 
McCarthy, M., McEwan,P., McKernan,K., McPheeters , R. , Meldrim,J., 
Meneus,L., Mihova,T., Mlenga,V., Murphy, T., Naylor,J., Nguyen, C, 
Norbu,C., Norman, C.H., 0 ' Connor, T., O ' Donnell , P . , 0'Neil,D., 
Oliver, J., Peterson, K., Phunkhang , P . , Pierre, N., Pollara,V. , 
Raymond, C, Retta,R., Rieback,M., Riley, R., Rise,C., Rogov,P., 
Roman, J., Rosetti,M., Roy, A., Santos, R., Schauer,S., Schupback, R . , 
Seaman, S., Severy,?., Spencer, B., Stange-Thomann, N. , Sto janovic , N. , 
Strauss,N., Subramanian, A . , Talamas,J., Tesfaye,S., Theodore, J., 
Topham,K., Travers,M., Travis, N., Trigilio,J., Vassiliev, H . , 
Viel,R., Vo,A., Wilson, B., Wu,X., Wyman,D., Ye,W.J., Young, G., 
Zainoun,J., Zembek,L., Zimmer,A. and Zody,M. 
Direct Submission 

Submitted (06-DEC-2001) Whitehead Institute/MIT Center for Genome 
Research, 320 Charles Street, Cambridge, MA 02141, USA 

3 (bases 1 to 164732) 

Birren,B., Linton, L., Nusbaiam,C., Lander, E., Ali,A., Allen, N., 
Anderson, S., Barna,N., Bastien,V. , Bloom, T., Boguslavkiy , L . , 
Boukhgalter, B. , Brown, A. , Camarata,J., Campopiano , A. , Chang, J., 
Chazaro,B., Choepel,Y., Colangelo, M. , Collins, S., Collymore,A. , 
Cook, A., Cooke, P., DeArellano, K. , Dewar,K., Diaz, J. S., Dodge, S., 
Faro,S., Ferreira,P., FitzHugh,W., Gage,D., Galagan,J., Gardyna,S., 
Ginde,S., Gord,S., Goyette,M., Graham, L., Grand-Pierre, N. , 
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